Design of An Integrated Parallel Processing System with Systolic VLSI Chips
نویسندگان
چکیده
The design of a massively parallel processing system IPU (Integrated Parallel Processing Unit) is described in this paper. It is a two-dimensional mesh-connected parallel proccesing array operated in an SIMD fashion. The current version of the IPU array is implemented with 64 systolic VLSI chips, each of which consists of 4 processing elements (PEs). Each PE consists of a 4-bit ALU, a 64* 4-bit RAM, several 4-bit registers and some multiplexers. Four PEs are implanted in a VLSI systolic chip using 2 micron CMOS technology. The IPU Department System that acts as a bridge between the host computer and the IPU array is also designed. We provide a high-level parallel programming environment for conveniently designing parallel programs for this IPU array. Some experimental results show that the performance of the IPU system is more than one hundred times faster than that of the host computer.
منابع مشابه
Design and Implementation of a High Speed Systolic Serial Multiplier and Squarer for Long Unsigned Integer Using VHDL
A systolic serial multiplier for unsigned numbers is presented which operates without zero words inserted between successive data words, outputs the full product and has only one clock cycle latency. 
The multiplier is based on a modified serial/parallel scheme with two adjacent multiplier cells. Systolic concept is a well-known means of intensive computational task through replication of fu...
متن کاملDesign and Implementation of a High Speed Systolic Serial Multiplier and Squarer for Long Unsigned Integer Using VHDL
A systolic serial multiplier for unsigned numbers is presented which operates without zero words inserted between successive data words, outputs the full product and has only one clock cycle latency. The multiplier is based on a modified serial/parallel scheme with two adjacent multiplier cells. Systolic concept is a well-known means of intensive computational task through replication of func...
متن کاملArchitectural, Numerical and Implementation Issues in the VLSI Design of an Integrated CORDIC-SVD Processor
The Singular Value Decomposition (SVD) is an important matrix factorization with applications in signal processing, image processing and robotics. This thesis presents some of the issues involved in the design of an array of special-purpose processors connected in a mesh, for fast real time computation of the SVD. The systolic array implements the Jacobi method for the SVD. This involves plane ...
متن کاملA Multiprocessor DSP System Using PADDI-2 - Design Automation Conference, 1998. Proceedings
We have integrated an image processing system built around PADDI-2, a custom 48 node MIMD parallel DSP. The system includes image processing algorithms, a graphical SFG tool, a simulator, routing tools, compilers, hardware configuration and debugging tools, application development libraries, and software implementations for hardware verification. The system board, connected to a SPARCstation vi...
متن کاملUnified VLSI systolic array design for LZ data compression
Hardware implementation of data compression algorithms is receiving increasing attention due to exponentially expanding network traffic and digital data storage usage. In this paper, we propose several serial one-dimensional and parallel two-dimensional systolic-arrays for Lempel–Ziv data compression. A VLSI chip implementing our optimal linear array is fabricated and tested. The proposed array...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Inf. Sci. Eng.
دوره 7 شماره
صفحات -
تاریخ انتشار 1991